EuropSisches Patentamt 
European Patent Office 
Office europeen des brevets 



IllllllUllllPIIMIlIIIIIll 

(H) EP 1 253 516 A2 



(12) EUROPEAN PATENT APPLICATION 



(43) 


Date of publication: 


(51) intci 7; G06F9/50 




30.10.2002 Bulletin 2002/44 


(21) 


Annliration number 02009207 8 

/VkV VJ IIOC4 LI \J 1 1 1 IUI 1 lUWl . Vfc W^JfcV ff »w 




(22) 


Date of filing: 24.04.2002 




(84) 


Designated Contracting States: 


(72) Inventors: 




AT BE CH CY DE DK ES Fl FR GB GR IE IT LI LU 


• Dorofeev, Andrei V. 




MCNLPTSETR 


Sunnyvale, CA 94086 (US) 




Designated Extension States: 


• Tucker, Andrew G. 




AL LT LV MK RO SI 


Mento Park, CA 94025 (US) 


(30) 


Priority: 25.04.2001 US 843426 


(74) Representative: HOFFMANN - EITLE 






Patent- und Rechtsanwalte 


(71) 


Applicant: Sun Microsystems, Inc. 


Arabellastrasse 4 




Palo Alto, California 94303 (US) 


81925 MUnchen (DE) 



(54) Apparatus and method for scheduling processes on a fair share basis 



(57) Described is a scheduling system that provides 
allocation of system resources of one or more processor 
sets among groups of processes. Each of the process 
groups is assigned a fixed number of shares, which is 
the number that is used to allocate system resources 
among processes of various process groups within a 
given processor set. The described fair share scheduler 
considers each processor set to be a separate virtual 
computer Different process sets do not share process- 
es, a particular process must execute on a single proc- 



essor set. In another embodiment of the invention, each 
p rocess group couid be given a separate number o f 
s hares fo r ea ch proc essor set. Percentage of the re- 
sources of the specific processor set allocated to proc- 
e sses of a process group is calculated as a ratio ofjth e 
shares of the process gr oup on the processor set to th e 
tot al number or sn ares of active process groups operat- 
ing in that set. The process group is considerea'acTive 
on a processor set, if that processor set executes at 
least one process of that process group. 
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Description 

FIELD 

[0001] The present invention generally relates to 5 
techniques for managing load of a central processing 
unit in a computer system. The present invention also 
relates to techniques for allocating central processing 
unit percentage usage among several processes. 

10 

BACKGROUND 

[0002] Modern high performance computer systems 
are capable of supporting multiple users and executing 
multiple processes or applications started by different is 
users at the same time. Term scheduling refers to allo- 
cation of central processor unit (CPU) resources among 
multiple processes owned by different users in a com- 
puter system. It will be appreciated by persons of skill 
in the art that the fraction of the total CPU resources 20 
allocated to a particular process may depend on the 
number of other processes in the system and a relative 
importance of the process in comparison with the other 
processes in the system. 

[0003] Anoth erwa y of allocating system resources i s 25 
to provide a process group, rath er than a singl e process , 
wit h a fixed shareof CPU resources . In other words, one 
or more processes executing on a central processing 
unit of a computer system can be combined into process 
groups, and each process group is allocated a share of 30 
the system resources that are distributed among indi- 
vidual processes of this process group. T he system re - 
sou rces can be distributed among_p j^o^sses^of-4he— 
procjELSs.g to upj n,an,eq ual.ocu nequal.mann e r. The proc- 
esses can be combined into aforementioned process 35 
groups based on various criteria, including, but not lim- 
ited to, user id of the user executing the process, group 
id of the user executing the process, or based on any 
other appropriate classification. The nature of the clas- 
sification is not critical to the present invention. 40 
[0004] M odern operating systems include .concept of 
process or set_s t _whichjs,a.grQjj p^of processes i executin g 
a specific^separateset of processes. O nly those pro c- 
esses that are explicitlyT>6T^ set are 

permittetTtc^be executed ^TTp roc^s^oTs^h^Belon g to 45 
th at^proces s-CJiseL- Presently, there exists no resource 
allocation scheme that would leverage advantages of 
the processor set architecture and have an ability to dis- 
tribute CPU resources among multiple process groups 
in a pre-defined manner. so 

SUMMARY 

[0005] It is an object of the invention to provide for im- 
proved allocation of system resources. 55 
[0006] According to an embodiment a method for al- 
locating a percentage of system resources among proc- 
ess groups in a computer system, said computer system 



comprising at least one central processing unit, said at 
least one central processing unit combined into at least 
one processor set, may comprise: a. assigning each of 
said process groups a number of shares for each or said 
at least one processor set; b. allocating said system re- 
sources of each of said at least one processor set to 
each of said process groups according to the number of 
shares assigned to said each of said process groups. 
[0007] Said system resources of each of said at least 
one processor set may be allocated based on a number 
of shares of all active groups within said each of said at 
least one processor set. 

[0008] Further, said percentage of said system re- 
sources may be calculated based on a ratio of the 
number of shares assigned to said each of said process 
groups to the a number of shares of all active groups 
within said each of said at least one processor set. 
[0009] Still further, each of said process groups may 
include only one process. 

[0010] A program may be provided having instruc- 
tions adapted to make a computer carry out at least one 
of the above operations. And, a computer readable me- 
dium may be provided, embodying the program. 
[001 1 ] According to another embodiment a computer 
readable medium may be provided, embodying a pro- 
gram for allocating a percentage of system resources 
among process groups in a computer system, said com- 
puter system comprising at least one central processing 
unit, said at least one central processing unit combined 
into at least one processor set, said program compris- 
ing: a. assigning each of said process groups a number 
of shares for each or said at least one processor set; b. 
allocating said system resources of each of said at least 
one processor set to each of said process groups ac- 
cording to the number of shares assigned to said each 
of said process groups. 

[0012] According to still another embodiment, a 
scheduler may be provide for allocating a percentage of 
system resources among process groups in a computer 
system with at least one central processing unit, said at 
least one central processing unit combined into at least 
one processor set, said scheduler comprising means for 
assigning each of said process groups a number of 
shares for each or said at least one processor set; and 
means for allocating said system resources of each of 
said at least one processor set to each of said process 
groups according to the number of shares assigned to 
said each of said process groups. 
[0013] Further advantageous features of the inven- 
tion are outlined in further claims. 

DESCRIPTION OF THE DRAWINGS 

[0014] Various embodiments of the present invention 
will now be described in detail by way of example only, 
and not by way of limitation, with reference to the at- 
tached drawings wherein identical or similar elements 
are designated with like numerals. 
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[0015] FIG. 1 illustrates exemplary allocation of sys- 
tem resources according to the inventive concept. 

DETAILED DESCRIPTION 

[0016] To overcome the limitations described above, 
and to overcome other limitations that will become ap- 
parent upon reading and understanding the present 
specification, apparatus, methods and articles of man- 
ufacture are disclosed that allocate a percentage of sys- 
tem resources among process groups in a computer 
system having one or more processor sets. 
[0017] According to the inventive method, processes 
are combined into process groups based on a pre-de- 
fined criteria. Each of the process groups is assigned a 
number of shares representing relative importance of 
the group within its processor set. The inventive system 
allocates the system resources of the processorsets to 
process groups according to the number of shares as- 
signed to a particular process group and the total 
number of shares of all active process groups in the 
processor set. A process group is considered active on 
a processor set if there is at least one process of this 
process group executing on that processor set. 
[0018] Fair share scheduling is a way to assign a par- 
ticular process a fixed share of CPU resources. The 
term share may be used to describe the relative impor- 
tance of one workload versus another. 
[0019] According to one of the aspects of the present 
invention, various processes in the system are com- 
bined into one or more process groups. These process 
groups users are assigned a number of shares which 
represent relative importance thereof. This is a way to 
guarantee application performance by explicitly allocat- 
ing shares (or percentage) of system CPU resources 
among competing workloads. Note that total number of 
shares assigned to all process groups need not be 100. 
Furthermore, to obtain the percentage of the system 
CPU resources available to a process group at each giv- 
en moment of time, a total number of shares allocated 
to that process group must be divided by the total 
number of shares possessed by all currently active proc- 
ess groups. A process group is considered active when 
it has at least one running or runnable process. Indeed, 
to ensure the complete, or 100% utilization of the sys- 
tem, only process groups which have executing proc- 
esses at a particular time should be given share of the 
CPU usage. Note that such active process groups are 
searched across the entire system. P^aj^j^v&Umer 
tne Pj*rcg BteggtSOb^ 

ess groups depends opthe ^.numberjpf^bares j ayfled by 
all ^^^I^^^locQss^gro yps in th gjsyslern^ o r the 
process groups that have at least one executing process 
in the system. Therefore, in a system where processes 
are combined into process groups based on user id, any 
new logged user with a given number of shares can de- 
crease the CPU percentage of all other actively running 
users. 



[0020] Modern operating systems, such as a Solaris 
Operating System distributed by Sun Microsystems, 
Inc. of Palo Alto, California have a concept of processor 
sets. Processor set concept applies to multiple proces- 
sor computers and allows the binding of one or more 
processors into groups of processors. Processors as- 
signed to processor sets will run only processes that 
have been bound to that processor set. In other words, 
the aforementioned processor set is essentially a virtual 
single- or multi-processor computer system within a 
physical computer, which has its own set of running 
processes. The concept of processor set is especially 
helpful, for example, when certain important process 
need to be provided with a separate one or more proc- 
essors. For example, in a computer system providing 
services to http clients, a separate processor set can be 
allocated to running a web server, while all other proc- 
esses can be executed on a second, separate proces- 
sor set. In this case, the amount of CPU resources allo- 
cated to the web server will not depend on the other 
processes executing in the system. 
[0021 ] However, when the aforementioned processor 
sets are used in conjunction with the conventional fair 
share scheduler, the performance of processes running 
on one processor set may be impacted by the work per- 
formed by processes running on another processor set, 
which is an undesirable effect. 
[0022] The reason why the existing fair share sched- 
uler does not work satisfactorily with processor sets is 
because that total number of shares for all active proc- 
ess groups is calculates across the entire system, when 
in fact it should be calculated only within boundaries of 
the current processor set. If the total number of shares 
is kept separate for each processor set, then the CPU 
allocation for a given process group will only depend on 
other process groups who have their active processes 
on the same processor set. The work done on other 
processor sets will be unaffected. This is more intuitive 
behavior of such configurations than what it has been 
in the past. 

[0023] The inventive fair share scheduler will now be 
described in detail. According to the inventive concept, 
various processes in a computer system are combined 
into process groups. Each of these process groups is 
assigned a fixed number of shares, which is the number 
that represents relative importance of process groups. 
The number of shares of a process group is used to al- 
locate system resources among processes of that proc- 
ess group executing within a predetermined processor 
set, in the manner described in detail below. Specifically, 
the inventive fair share scheduler considers each proc- 
essor set to be a separate virtual computer, jiff ere at 
p ro cessor sets do notjsha re ^pro cesses, in other word s , 
a pr ocess m ust execute on a single processor se t. 
[0024] In one embodiment of the invention, each proc- 
ess group is given the same number of shares for all 
processor set. It should be noted that even if process 
group has zero shares, processes of this process group 
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may still be executed on processor sets, which do not 
have any other active process groups. Percentage of 
the resources of the specific processor set allocated to 
processes of a particular process group is calculated as 
a ratio of the shares of that process group on the proc- 5 
essor set to the total number of shares of all active proc- 
ess groups operating in that set. The process group is 
considered active on a processor set, if that processor 
set executes at least one process of that process group. 
[0025] Now, an exemplary illustration of the inventive io 
scheduling concept will be presented, see Fig. 1 . In this 
illustration, acomputer system includes five processors: 
p1-p5 and three process groups g1, g2 and g3. The 
processors p1 and p2 constitute processor set s1 . Proc- 
essors p3, p4, and p5 are combined into processor set *5 
s2. Group q 1 is assigne d 3 shares and It is e ^ ecuting^ 
onjhVpocesjSQr se t s1 . Process groups g2 and g3 are 
assigned 1 and 2 shares respectively, and they both ex- 
ecute on processor set s2. 

[0026] The conventional fair share scheduler de- 20 
scribed above would allocate the system resources 
without reference to the processor sets. Specifically, it 
would assign 3/(3+1 +2) = 3/6 = 50% of the entire system 
resources to group g1. Group g2 would receive 1/ 
(3+1+2) = 1/6 = 16.7%, while group g3 would get 2/ 25 
(3+1+2) = 2/6 = 33.3% of the system resources. 
[0027] The inventive system, on the other hand, group 
g1 receives entire processor set s1 , which is 40% = 
100% * 2/5 of the total system resources. Groups g2 
and g3 jointly executing on processor set s2 receive 1/3 30 
* 3/5 = 1 /5 = 20% and 2/3 * 3/5 = 2/5 = 40% of the system 
resources, respectively. It should be noted that the per- 
centage of the system resources assigned to process 
group g1 is independent of the number of the process 
groups and the toad of the processor set s2. 35 
[0028] According to another embodiment, a schedul- 
er for fair share scheduling may be provided for allocat- 
ing a percentage of system resources among process 
groups in a computer system. The computer system 
may include at least one central processing unit, said at 40 
least one central processing unit combined into at least 
one processor set. The central processing units may be 
located on one or a plurality of data processing devices, 
and may communicate with one another through inter- 
nal communication links or external communication 45 
links. 

[0029] The scheduler may be provided as an integral 
part of the computer system, e.g. in the form of a pro- 
gram or group of programs executed on at least one of 
the central processing units of the computer system or 50 
may be provided with at least one external device. 
[0030] The scheduler for may comprise means for re- 
alizing the functionality outlined with respect to the pre- 
vious embodiments. In particular, the scheduler may in- 
clude means for assigning each of said process groups ss 
a number of shares for each or said at least one proc- 
essor set. and means for allocating said system resourc- 
es of each of said at least one processor set to each of 



said process groups according to the number of shares 
assigned to said each of said process groups. 
[0031] Further, the scheduler may include means for 
allocating said system resources of each of said at least 
one processor set based on a number of shares of ail 
active groups within said each of said at least one proc- 
essor set. 

[0032] Moreover, the scheduler may include means 
for calculating said percentage of said system resources 
based on a ratio of the number of shares assigned to 
said each of said process groups to the a number of 
shares of all active groups within said each of said at 
least one processor set. 

[0033] Each of said process groups may include only 
one process. 

[0034] While the invention has been described herein 
with reference to preferred embodiments thereof, it will 
be readily apparent to persons of skill in the art that var- 
ious modifications in form of detail can be made with 
respect thereto without departing from the spirit and 
scope of the invention as defined in and by the append- 
ed claims. For example, the present invention is not lim- 
ited allocating CPU shares only. Generally, processes, 
users, or groups of users can be allocated shares relat- 
ing to various system resources, such as CPU, disk or 
memory usage. In addition, shares need not be as- 
signed to groups of users. According to the inventive 
concept, shares can be assigned to processes, users, 
or groups of users, or in any other manner. It will also 
be appreciated by those of skill in the art, that a number 
of processors in a processor set may not be an integer 
number. 

[0035] In other words, the concept of processor set 
should be viewed, in the context of the present inven- 
tion, as allocating shares of the total processor resourc- 
es of a computer system among virtual computers (proc- 
essor sets), that do not share processes, rather than 
simply combining resources of physical CPUs. Finally, 
each process group could be given a separate number 
of shares for each processor set. 
[0036] Those of skill in the art will undoubtedly appre- 
ciate that the invention can be implemented on a wide 
variety of computer systems including, but not limited 
to, general purpose computers and special purpose 
computers such as network appliances. As well known 
in the art, a computer consists at least of a central 
processing unit, a memory unit, and an input/output in- 
terface. The aforementioned computer components can 
be arranged separately, or they can be combined to- 
gether into a single unit. The computer memory unit may 
include a random access memory (RAM) and/or read 
only memory (ROM). The present invention can be im- 
plemented as a computer program embodied in any tan- 
gible storage medium, or loaded into the computer 
memory by any known means. As an alternative to im- 
plementing the present invention as a computer pro- 
gram, the present invention can be also embodied into 
an electronic circuit. This embodiment may provide an 
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improved performance characteristics. 
[0037] According to another embodiment, a computer 
system may be provided comprising at least a central 
processing unit and a memory, said memory storing a 
program for allocating a percentage of system resourc- 
es among process groups in a computer system, said 
computer system comprising at least one central 
processing unit, said at least one central processing unit 
combined into at least one processor set, said program 
comprising instructions for assigning each of said proc- 
ess groups a number of shares for each or said at least 
one processor set allocating said system resources of 
each of said at least one processor set to each of said 
process groups according to the number of shares as- 
signed to said each of said process groups. 
[0038] Further, it is noted that a computer-readable 
medium may be provided having a program embodied 
thereon, where the program is to make a computer or a 
system of data processing devices to execute functions 
or operations of the features and elements of the above 
described examples. A computer-readable medium can 
be a magnetic or optical or other tangible medium on 
which a program is recorded, but can also be a signal, 
e.g. analog or digital, electronic, magnetic or optical, in 
which the program is embodied for transmission. Fur- 
ther, a computer program product may be provided com- 
prising the computer-readable medium. 



Claims 



groups to the a number of shares of all active groups 
within said each of said at least one processor set. 

4. The method of at least one of the claims 1 to 3, 
5 wherein each of said process groups includes only 

one process. 

5. A computer readable medium embodying a pro- 
gram for allocating a percentage of system resourc- 

10 es among process groups in a computer system, 
said computer system comprising at least one cen- 
tral processing unit, said at least one central 
processing unit combined into at least one proces- 
sor set, said program comprising: 

15 

a. assigning each of said process groups a 
number of shares for each or said at least one 
processor set; 



20 b. allocating said system resources of each of 

said at least one processor set to each of said 
process groups according to the number of 
shares assigned to said each of said process 
groups. 

25 

6. The computer readable medium of claim 5, wherein 
said system resources of each of said at least one 
processor set are allocated based on a number of 
shares of all active groups within said each of said 
30 at least one processor set. 



1 . A method for allocating a percentage of system re- 
sources among process groups in a computer sys- 
tem, said computer system comprising at least one 
central processing unit, said at least one central 35 
processing unit. combined into at least one proces- 
sor set, said method comprising: 

a. assigning each of said process groups a 
number of shares for each or said at least one 40 
processor set; 

b. allocating said system resources of each of 
said at least one processor set to each of said 
process groups according to the number of 45 
shares assigned to said each of said process 
groups. 

2. The method of claim 1 , wherein said system re- 
sources of each of said at least one processor set so 
are allocated based on a number of shares of aN 
active groups within said each of said at least one 
processor set. 

3. The method of at least one of the claims 1 and 2, ss 
wherein said percentage of said system resources 

is calculated based on a ratio of the number of 
shares assigned to said each of said process 



7. The computer readable medium of at least one of 
the claims 5 and 6, wherein said percentage of said 
system resources is calculated based on a ratio of 
the number of shares assigned to said each of said 
process groups to the a number of shares of all ac- 
tive groups within said each of said at least one 
processor set. 

8. The computer readable medium of at least one of 
the claims 5 to 8, wherein each of said process 
groups includes only one process. 

9. A program having instructions adapted to. make a 
computer carry out the method of at least one of the 
claims 1 - 4. 

10. A scheduler for allocating a percentage of system 
resources among process groups in a computer 
system having at least one central processing unit, 
said at least one central processing unit combined 
into at least one processor set, the scheduler com- 
prising: 

means for assigning each of said process 
groups a number of shares for each or said at 
least one processor set; and 
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means for allocating said system resources of 
each of said at least one processor set to each 
of said process groups according to the number 
of shares assigned to said each of said process 
groups. 5 

11. The scheduler of claim 10, including means for al- 
locating said system resources of each of said at 
least one processor set based on a number of 
shares of all active groups within said each of said io 
at least one processor set. 

1 2. The scheduler of at least one of the claims 1 0 and 
11 , including means for calculating said percentage 

of said system resources based on a ratio of the ^ 
number of shares assigned to said each of said 
process groups to the a number of shares of all ac- 
tive groups within said each of said at least one 
processor set. 

20 

13. The scheduler of at least one of the claims 10 to 12, 
wherein each of said process groups includes only 
one process. 

25 
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